Dataset statistics
| Number of variables | 28 |
|---|---|
| Number of observations | 11991358 |
| Missing cells | 62632869 |
| Missing cells (%) | 18.7% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 2.5 GiB |
| Average record size in memory | 224.0 B |
Variable types
| DateTime | 1 |
|---|---|
| Categorical | 4 |
| Text | 5 |
| Numeric | 18 |
ARR_DELAY is highly overall correlated with CANCELLED and 1 other fields | High correlation |
ARR_TIME is highly overall correlated with CANCELLED and 1 other fields | High correlation |
CANCELLATION_CODE is highly overall correlated with CANCELLED and 1 other fields | High correlation |
CANCELLED is highly overall correlated with ARR_DELAY and 8 other fields | High correlation |
CARRIER_DELAY is highly overall correlated with CANCELLED | High correlation |
DEP_DELAY is highly overall correlated with ARR_DELAY | High correlation |
DEP_TIME is highly overall correlated with ARR_TIME | High correlation |
DEST_AIRPORT_ID is highly overall correlated with DEST_AIRPORT_SEQ_ID and 1 other fields | High correlation |
DEST_AIRPORT_SEQ_ID is highly overall correlated with DEST_AIRPORT_ID and 1 other fields | High correlation |
DEST_CITY_MARKET_ID is highly overall correlated with DEST_AIRPORT_ID and 1 other fields | High correlation |
LATE_AIRCRAFT_DELAY is highly overall correlated with CANCELLED | High correlation |
MMYYYY is highly overall correlated with CANCELLATION_CODE | High correlation |
NAS_DELAY is highly overall correlated with CANCELLED | High correlation |
OP_CARRIER is highly overall correlated with OP_UNIQUE_CARRIER | High correlation |
OP_UNIQUE_CARRIER is highly overall correlated with OP_CARRIER | High correlation |
ORIGIN_AIRPORT_ID is highly overall correlated with ORIGIN_AIRPORT_SEQ_ID and 1 other fields | High correlation |
ORIGIN_AIRPORT_SEQ_ID is highly overall correlated with ORIGIN_AIRPORT_ID and 1 other fields | High correlation |
ORIGIN_CITY_MARKET_ID is highly overall correlated with ORIGIN_AIRPORT_ID and 1 other fields | High correlation |
SECURITY_DELAY is highly overall correlated with CANCELLED | High correlation |
TAXI_IN is highly overall correlated with CANCELLED | High correlation |
WEATHER_DELAY is highly overall correlated with CANCELLED | High correlation |
CANCELLED is highly imbalanced (85.8%) | Imbalance |
DEP_TIME has 234621 (2.0%) missing values | Missing |
DEP_DELAY has 234846 (2.0%) missing values | Missing |
TAXI_OUT has 239344 (2.0%) missing values | Missing |
TAXI_IN has 244367 (2.0%) missing values | Missing |
ARR_TIME has 244362 (2.0%) missing values | Missing |
ARR_DELAY has 268767 (2.2%) missing values | Missing |
CANCELLATION_CODE has 11750698 (98.0%) missing values | Missing |
CARRIER_DELAY has 9870493 (82.3%) missing values | Missing |
WEATHER_DELAY has 9870493 (82.3%) missing values | Missing |
NAS_DELAY has 9870493 (82.3%) missing values | Missing |
SECURITY_DELAY has 9870493 (82.3%) missing values | Missing |
LATE_AIRCRAFT_DELAY has 9870493 (82.3%) missing values | Missing |
WEATHER_DELAY is highly skewed (γ1 = 20.82767831) | Skewed |
SECURITY_DELAY is highly skewed (γ1 = 112.0395807) | Skewed |
DEP_DELAY has 592302 (4.9%) zeros | Zeros |
ARR_DELAY has 229257 (1.9%) zeros | Zeros |
CARRIER_DELAY has 986724 (8.2%) zeros | Zeros |
WEATHER_DELAY has 1999251 (16.7%) zeros | Zeros |
NAS_DELAY has 1037152 (8.6%) zeros | Zeros |
SECURITY_DELAY has 2111466 (17.6%) zeros | Zeros |
LATE_AIRCRAFT_DELAY has 1058694 (8.8%) zeros | Zeros |
Reproduction
| Analysis started | 2024-09-20 19:01:24.221822 |
|---|---|
| Analysis finished | 2024-09-20 19:29:33.565584 |
| Duration | 28 minutes and 9.34 seconds |
| Software version | ydata-profiling vv4.10.0 |
| Download configuration | config.json |
FL_DATE
Date
| Distinct | 3653 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 91.5 MiB |
| Minimum | 2014-07-01 00:00:00 |
|---|---|
| Maximum | 2024-06-30 00:00:00 |
OP_UNIQUE_CARRIER
Categorical
HIGH CORRELATION 
| Distinct | 21 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 91.5 MiB |
| WN | |
|---|---|
| DL | |
| AA | |
| OO | |
| UA | |
| Other values (16) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 23982716 |
|---|---|
| Distinct characters | 22 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | AA |
|---|---|
| 2nd row | VX |
| 3rd row | DL |
| 4th row | WN |
| 5th row | EV |
Common Values
| Value | Count | Frequency (%) |
| WN | 2464151 | |
| DL | 1686476 | |
| AA | 1589002 | |
| OO | 1346595 | |
| UA | 1064221 | |
| B6 | 495823 | 4.1% |
| EV | 431775 | 3.6% |
| MQ | 416071 | 3.5% |
| AS | 388825 | 3.2% |
| YX | 360992 | 3.0% |
| Other values (11) | 1747427 |
Length
| Value | Count | Frequency (%) |
| wn | 2464151 | |
| dl | 1686476 | |
| aa | 1589002 | |
| oo | 1346595 | |
| ua | 1064221 | |
| b6 | 495823 | 4.1% |
| ev | 431775 | 3.6% |
| mq | 416071 | 3.5% |
| as | 388825 | 3.2% |
| yx | 360992 | 3.0% |
| Other values (11) | 1747427 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 4769579 | |
| O | 2971266 | |
| N | 2795375 | |
| W | 2464151 | |
| L | 1692532 | 7.1% |
| D | 1686476 | 7.0% |
| U | 1147700 | 4.8% |
| E | 716785 | 3.0% |
| V | 643835 | 2.7% |
| 9 | 523306 | 2.2% |
| Other values (12) | 4571711 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 23982716 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 4769579 | |
| O | 2971266 | |
| N | 2795375 | |
| W | 2464151 | |
| L | 1692532 | 7.1% |
| D | 1686476 | 7.0% |
| U | 1147700 | 4.8% |
| E | 716785 | 3.0% |
| V | 643835 | 2.7% |
| 9 | 523306 | 2.2% |
| Other values (12) | 4571711 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 23982716 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 4769579 | |
| O | 2971266 | |
| N | 2795375 | |
| W | 2464151 | |
| L | 1692532 | 7.1% |
| D | 1686476 | 7.0% |
| U | 1147700 | 4.8% |
| E | 716785 | 3.0% |
| V | 643835 | 2.7% |
| 9 | 523306 | 2.2% |
| Other values (12) | 4571711 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 23982716 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 4769579 | |
| O | 2971266 | |
| N | 2795375 | |
| W | 2464151 | |
| L | 1692532 | 7.1% |
| D | 1686476 | 7.0% |
| U | 1147700 | 4.8% |
| E | 716785 | 3.0% |
| V | 643835 | 2.7% |
| 9 | 523306 | 2.2% |
| Other values (12) | 4571711 |
OP_CARRIER
Categorical
HIGH CORRELATION 
| Distinct | 21 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 91.5 MiB |
| WN | |
|---|---|
| DL | |
| AA | |
| OO | |
| UA | |
| Other values (16) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 23982716 |
|---|---|
| Distinct characters | 22 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | AA |
|---|---|
| 2nd row | VX |
| 3rd row | DL |
| 4th row | WN |
| 5th row | EV |
Common Values
| Value | Count | Frequency (%) |
| WN | 2464151 | |
| DL | 1686476 | |
| AA | 1589002 | |
| OO | 1346595 | |
| UA | 1064221 | |
| B6 | 495823 | 4.1% |
| EV | 431775 | 3.6% |
| MQ | 416071 | 3.5% |
| AS | 388825 | 3.2% |
| YX | 360992 | 3.0% |
| Other values (11) | 1747427 |
Length
| Value | Count | Frequency (%) |
| wn | 2464151 | |
| dl | 1686476 | |
| aa | 1589002 | |
| oo | 1346595 | |
| ua | 1064221 | |
| b6 | 495823 | 4.1% |
| ev | 431775 | 3.6% |
| mq | 416071 | 3.5% |
| as | 388825 | 3.2% |
| yx | 360992 | 3.0% |
| Other values (11) | 1747427 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 4769579 | |
| O | 2971266 | |
| N | 2795375 | |
| W | 2464151 | |
| L | 1692532 | 7.1% |
| D | 1686476 | 7.0% |
| U | 1147700 | 4.8% |
| E | 716785 | 3.0% |
| V | 643835 | 2.7% |
| 9 | 523306 | 2.2% |
| Other values (12) | 4571711 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 23982716 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 4769579 | |
| O | 2971266 | |
| N | 2795375 | |
| W | 2464151 | |
| L | 1692532 | 7.1% |
| D | 1686476 | 7.0% |
| U | 1147700 | 4.8% |
| E | 716785 | 3.0% |
| V | 643835 | 2.7% |
| 9 | 523306 | 2.2% |
| Other values (12) | 4571711 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 23982716 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 4769579 | |
| O | 2971266 | |
| N | 2795375 | |
| W | 2464151 | |
| L | 1692532 | 7.1% |
| D | 1686476 | 7.0% |
| U | 1147700 | 4.8% |
| E | 716785 | 3.0% |
| V | 643835 | 2.7% |
| 9 | 523306 | 2.2% |
| Other values (12) | 4571711 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 23982716 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 4769579 | |
| O | 2971266 | |
| N | 2795375 | |
| W | 2464151 | |
| L | 1692532 | 7.1% |
| D | 1686476 | 7.0% |
| U | 1147700 | 4.8% |
| E | 716785 | 3.0% |
| V | 643835 | 2.7% |
| 9 | 523306 | 2.2% |
| Other values (12) | 4571711 |
TAIL_NUM
Text
| Distinct | 9449 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 63399 |
| Missing (%) | 0.5% |
| Memory size | 91.5 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 5.9864892 |
| Min length | 3 |
Characters and Unicode
| Total characters | 71406598 |
|---|---|
| Distinct characters | 34 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 22 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | N3GAAA |
|---|---|
| 2nd row | N634VA |
| 3rd row | N974DL |
| 4th row | N924WN |
| 5th row | N691CA |
| Value | Count | Frequency (%) |
| n493ha | 5807 | < 0.1% |
| n480ha | 5800 | < 0.1% |
| n491ha | 5694 | < 0.1% |
| n484ha | 5667 | < 0.1% |
| n486ha | 5663 | < 0.1% |
| n492ha | 5652 | < 0.1% |
| n487ha | 5509 | < 0.1% |
| n476ha | 5466 | < 0.1% |
| n479ha | 5452 | < 0.1% |
| n483ha | 5451 | < 0.1% |
| Other values (9439) | 11871798 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 15285084 | |
| 8 | 4371608 | 6.1% |
| 3 | 4264972 | 6.0% |
| 7 | 4187773 | 5.9% |
| 2 | 4060034 | 5.7% |
| 9 | 4026712 | 5.6% |
| 6 | 3812001 | 5.3% |
| 1 | 3741400 | 5.2% |
| 5 | 3737769 | 5.2% |
| 4 | 3702938 | 5.2% |
| Other values (24) | 20216307 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 71406598 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| N | 15285084 | |
| 8 | 4371608 | 6.1% |
| 3 | 4264972 | 6.0% |
| 7 | 4187773 | 5.9% |
| 2 | 4060034 | 5.7% |
| 9 | 4026712 | 5.6% |
| 6 | 3812001 | 5.3% |
| 1 | 3741400 | 5.2% |
| 5 | 3737769 | 5.2% |
| 4 | 3702938 | 5.2% |
| Other values (24) | 20216307 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 71406598 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| N | 15285084 | |
| 8 | 4371608 | 6.1% |
| 3 | 4264972 | 6.0% |
| 7 | 4187773 | 5.9% |
| 2 | 4060034 | 5.7% |
| 9 | 4026712 | 5.6% |
| 6 | 3812001 | 5.3% |
| 1 | 3741400 | 5.2% |
| 5 | 3737769 | 5.2% |
| 4 | 3702938 | 5.2% |
| Other values (24) | 20216307 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 71406598 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| N | 15285084 | |
| 8 | 4371608 | 6.1% |
| 3 | 4264972 | 6.0% |
| 7 | 4187773 | 5.9% |
| 2 | 4060034 | 5.7% |
| 9 | 4026712 | 5.6% |
| 6 | 3812001 | 5.3% |
| 1 | 3741400 | 5.2% |
| 5 | 3737769 | 5.2% |
| 4 | 3702938 | 5.2% |
| Other values (24) | 20216307 |
ORIGIN_AIRPORT_ID
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 387 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12661.256 |
| Minimum | 10135 |
|---|---|
| Maximum | 16869 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 91.5 MiB |
Quantile statistics
| Minimum | 10135 |
|---|---|
| 5-th percentile | 10397 |
| Q1 | 11292 |
| median | 12889 |
| Q3 | 14027 |
| 95-th percentile | 14893 |
| Maximum | 16869 |
| Range | 6734 |
| Interquartile range (IQR) | 2735 |
Descriptive statistics
| Standard deviation | 1530.5848 |
|---|---|
| Coefficient of variation (CV) | 0.12088728 |
| Kurtosis | -1.3072117 |
| Mean | 12661.256 |
| Median Absolute Deviation (MAD) | 1568 |
| Skewness | 0.082049793 |
| Sum | 1.5182565 × 1011 |
| Variance | 2342689.8 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10397 | 677160 | 5.6% |
| 13930 | 524184 | 4.4% |
| 11298 | 500590 | 4.2% |
| 11292 | 472741 | 3.9% |
| 12892 | 379151 | 3.2% |
| 11057 | 330627 | 2.8% |
| 14107 | 314020 | 2.6% |
| 12889 | 300032 | 2.5% |
| 14771 | 281090 | 2.3% |
| 14747 | 274205 | 2.3% |
| Other values (377) | 7937558 |
| Value | Count | Frequency (%) |
| 10135 | 6929 | 0.1% |
| 10136 | 3309 | < 0.1% |
| 10140 | 39899 | |
| 10141 | 1457 | < 0.1% |
| 10146 | 1732 | < 0.1% |
| 10154 | 1848 | < 0.1% |
| 10155 | 2693 | < 0.1% |
| 10157 | 3087 | < 0.1% |
| 10158 | 5915 | < 0.1% |
| 10165 | 224 | < 0.1% |
| Value | Count | Frequency (%) |
| 16869 | 1121 | < 0.1% |
| 16218 | 3452 | < 0.1% |
| 16133 | 1 | < 0.1% |
| 16101 | 261 | < 0.1% |
| 15991 | 1426 | < 0.1% |
| 15919 | 18604 | |
| 15897 | 500 | < 0.1% |
| 15841 | 1428 | < 0.1% |
| 15624 | 14409 | |
| 15607 | 1813 | < 0.1% |
ORIGIN_AIRPORT_SEQ_ID
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 760 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1266129.1 |
| Minimum | 1013503 |
|---|---|
| Maximum | 1686902 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 91.5 MiB |
Quantile statistics
| Minimum | 1013503 |
|---|---|
| 5-th percentile | 1039707 |
| Q1 | 1129202 |
| median | 1288903 |
| Q3 | 1402702 |
| 95-th percentile | 1489302 |
| Maximum | 1686902 |
| Range | 673399 |
| Interquartile range (IQR) | 273500 |
Descriptive statistics
| Standard deviation | 153058.22 |
|---|---|
| Coefficient of variation (CV) | 0.12088674 |
| Kurtosis | -1.3072166 |
| Mean | 1266129.1 |
| Median Absolute Deviation (MAD) | 156799 |
| Skewness | 0.082052193 |
| Sum | 1.5182608 × 1013 |
| Variance | 2.3426818 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1129202 | 472741 | 3.9% |
| 1039707 | 408332 | 3.4% |
| 1129806 | 341157 | 2.8% |
| 1105703 | 330627 | 2.8% |
| 1410702 | 314020 | 2.6% |
| 1474703 | 274205 | 2.3% |
| 1226603 | 268259 | 2.2% |
| 1039705 | 262589 | 2.2% |
| 1320402 | 257213 | 2.1% |
| 1143302 | 250263 | 2.1% |
| Other values (750) | 8811952 |
| Value | Count | Frequency (%) |
| 1013503 | 1547 | < 0.1% |
| 1013504 | 52 | < 0.1% |
| 1013505 | 1461 | < 0.1% |
| 1013506 | 3869 | < 0.1% |
| 1013603 | 3309 | < 0.1% |
| 1014003 | 14212 | |
| 1014004 | 343 | < 0.1% |
| 1014005 | 25344 | |
| 1014103 | 396 | < 0.1% |
| 1014104 | 97 | < 0.1% |
| Value | Count | Frequency (%) |
| 1686902 | 479 | < 0.1% |
| 1686901 | 642 | < 0.1% |
| 1621802 | 1852 | < 0.1% |
| 1621801 | 1600 | < 0.1% |
| 1613305 | 1 | < 0.1% |
| 1610102 | 261 | < 0.1% |
| 1599102 | 1426 | < 0.1% |
| 1591905 | 3604 | < 0.1% |
| 1591904 | 9677 | |
| 1591903 | 89 | < 0.1% |
ORIGIN_CITY_MARKET_ID
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 361 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 31730.829 |
| Minimum | 30070 |
|---|---|
| Maximum | 36133 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 91.5 MiB |
Quantile statistics
| Minimum | 30070 |
|---|---|
| 5-th percentile | 30194 |
| Q1 | 30647 |
| median | 31453 |
| Q3 | 32467 |
| 95-th percentile | 34524 |
| Maximum | 36133 |
| Range | 6063 |
| Interquartile range (IQR) | 1820 |
Descriptive statistics
| Standard deviation | 1306.2894 |
|---|---|
| Coefficient of variation (CV) | 0.041167832 |
| Kurtosis | -0.25314046 |
| Mean | 31730.829 |
| Median Absolute Deviation (MAD) | 987 |
| Skewness | 0.82282807 |
| Sum | 3.8049573 × 1011 |
| Variance | 1706392.1 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 31703 | 685923 | 5.7% |
| 30977 | 682009 | 5.7% |
| 30397 | 677160 | 5.6% |
| 30194 | 629600 | 5.3% |
| 32575 | 570201 | 4.8% |
| 30325 | 472741 | 3.9% |
| 30852 | 469210 | 3.9% |
| 32457 | 459369 | 3.8% |
| 31453 | 373617 | 3.1% |
| 31057 | 330627 | 2.8% |
| Other values (351) | 6640901 |
| Value | Count | Frequency (%) |
| 30070 | 1294 | < 0.1% |
| 30073 | 1392 | < 0.1% |
| 30082 | 71 | < 0.1% |
| 30107 | 1273 | < 0.1% |
| 30113 | 1551 | < 0.1% |
| 30135 | 6929 | 0.1% |
| 30136 | 3309 | < 0.1% |
| 30140 | 39899 | |
| 30141 | 1457 | < 0.1% |
| 30146 | 1732 | < 0.1% |
| Value | Count | Frequency (%) |
| 36133 | 1 | < 0.1% |
| 36101 | 261 | < 0.1% |
| 35991 | 1426 | |
| 35897 | 500 | < 0.1% |
| 35841 | 1428 | |
| 35582 | 771 | |
| 35569 | 416 | < 0.1% |
| 35550 | 1813 | |
| 35497 | 106 | < 0.1% |
| 35454 | 264 | < 0.1% |
ORIGIN
Text
| Distinct | 387 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 91.5 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 35974074 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | MIA |
|---|---|
| 2nd row | JFK |
| 3rd row | MCO |
| 4th row | LAX |
| 5th row | ATL |
| Value | Count | Frequency (%) |
| atl | 677160 | 5.6% |
| ord | 524184 | 4.4% |
| dfw | 500590 | 4.2% |
| den | 472741 | 3.9% |
| lax | 379151 | 3.2% |
| clt | 330627 | 2.8% |
| phx | 314020 | 2.6% |
| las | 300032 | 2.5% |
| sfo | 281090 | 2.3% |
| sea | 274205 | 2.3% |
| Other values (377) | 7937558 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 4065594 | 11.3% |
| L | 3392977 | 9.4% |
| S | 3025997 | 8.4% |
| D | 2835688 | 7.9% |
| T | 2015900 | 5.6% |
| O | 1913255 | 5.3% |
| C | 1742869 | 4.8% |
| M | 1605987 | 4.5% |
| F | 1508770 | 4.2% |
| W | 1446960 | 4.0% |
| Other values (16) | 12420077 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 35974074 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 4065594 | 11.3% |
| L | 3392977 | 9.4% |
| S | 3025997 | 8.4% |
| D | 2835688 | 7.9% |
| T | 2015900 | 5.6% |
| O | 1913255 | 5.3% |
| C | 1742869 | 4.8% |
| M | 1605987 | 4.5% |
| F | 1508770 | 4.2% |
| W | 1446960 | 4.0% |
| Other values (16) | 12420077 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 35974074 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 4065594 | 11.3% |
| L | 3392977 | 9.4% |
| S | 3025997 | 8.4% |
| D | 2835688 | 7.9% |
| T | 2015900 | 5.6% |
| O | 1913255 | 5.3% |
| C | 1742869 | 4.8% |
| M | 1605987 | 4.5% |
| F | 1508770 | 4.2% |
| W | 1446960 | 4.0% |
| Other values (16) | 12420077 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 35974074 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 4065594 | 11.3% |
| L | 3392977 | 9.4% |
| S | 3025997 | 8.4% |
| D | 2835688 | 7.9% |
| T | 2015900 | 5.6% |
| O | 1913255 | 5.3% |
| C | 1742869 | 4.8% |
| M | 1605987 | 4.5% |
| F | 1508770 | 4.2% |
| W | 1446960 | 4.0% |
| Other values (16) | 12420077 |
ORIGIN_CITY_NAME
Text
| Distinct | 379 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 91.5 MiB |
Length
| Max length | 34 |
|---|---|
| Median length | 29 |
| Mean length | 13.081675 |
| Min length | 8 |
Characters and Unicode
| Total characters | 156867047 |
|---|---|
| Distinct characters | 58 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Miami, FL |
|---|---|
| 2nd row | New York, NY |
| 3rd row | Orlando, FL |
| 4th row | Los Angeles, CA |
| 5th row | Atlanta, GA |
| Value | Count | Frequency (%) |
| ca | 1356866 | 4.9% |
| tx | 1302483 | 4.7% |
| fl | 1001524 | 3.6% |
| ga | 717105 | 2.6% |
| il | 710063 | 2.5% |
| chicago | 682009 | 2.4% |
| atlanta | 677160 | 2.4% |
| san | 654152 | 2.3% |
| ny | 565378 | 2.0% |
| co | 527970 | 1.9% |
| Other values (456) | 19769310 |
Most occurring characters
| Value | Count | Frequency (%) |
| 15972662 | 10.2% | |
| a | 12090028 | 7.7% |
| , | 11991358 | 7.6% |
| o | 8774610 | 5.6% |
| e | 8182016 | 5.2% |
| n | 7722499 | 4.9% |
| t | 7686368 | 4.9% |
| l | 6941152 | 4.4% |
| i | 5993030 | 3.8% |
| r | 5573961 | 3.6% |
| Other values (48) | 65939363 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 156867047 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 15972662 | 10.2% | |
| a | 12090028 | 7.7% |
| , | 11991358 | 7.6% |
| o | 8774610 | 5.6% |
| e | 8182016 | 5.2% |
| n | 7722499 | 4.9% |
| t | 7686368 | 4.9% |
| l | 6941152 | 4.4% |
| i | 5993030 | 3.8% |
| r | 5573961 | 3.6% |
| Other values (48) | 65939363 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 156867047 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 15972662 | 10.2% | |
| a | 12090028 | 7.7% |
| , | 11991358 | 7.6% |
| o | 8774610 | 5.6% |
| e | 8182016 | 5.2% |
| n | 7722499 | 4.9% |
| t | 7686368 | 4.9% |
| l | 6941152 | 4.4% |
| i | 5993030 | 3.8% |
| r | 5573961 | 3.6% |
| Other values (48) | 65939363 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 156867047 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 15972662 | 10.2% | |
| a | 12090028 | 7.7% |
| , | 11991358 | 7.6% |
| o | 8774610 | 5.6% |
| e | 8182016 | 5.2% |
| n | 7722499 | 4.9% |
| t | 7686368 | 4.9% |
| l | 6941152 | 4.4% |
| i | 5993030 | 3.8% |
| r | 5573961 | 3.6% |
| Other values (48) | 65939363 |
DEST_AIRPORT_ID
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 387 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12660.67 |
| Minimum | 10135 |
|---|---|
| Maximum | 16869 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 91.5 MiB |
Quantile statistics
| Minimum | 10135 |
|---|---|
| 5-th percentile | 10397 |
| Q1 | 11292 |
| median | 12889 |
| Q3 | 14027 |
| 95-th percentile | 14893 |
| Maximum | 16869 |
| Range | 6734 |
| Interquartile range (IQR) | 2735 |
Descriptive statistics
| Standard deviation | 1530.3726 |
|---|---|
| Coefficient of variation (CV) | 0.12087612 |
| Kurtosis | -1.3058999 |
| Mean | 12660.67 |
| Median Absolute Deviation (MAD) | 1568 |
| Skewness | 0.083446859 |
| Sum | 1.5181862 × 1011 |
| Variance | 2342040.3 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10397 | 677146 | 5.6% |
| 13930 | 523625 | 4.4% |
| 11298 | 501097 | 4.2% |
| 11292 | 473432 | 3.9% |
| 12892 | 379305 | 3.2% |
| 11057 | 331170 | 2.8% |
| 14107 | 314299 | 2.6% |
| 12889 | 299584 | 2.5% |
| 14771 | 280522 | 2.3% |
| 14747 | 273070 | 2.3% |
| Other values (377) | 7938108 |
| Value | Count | Frequency (%) |
| 10135 | 7024 | 0.1% |
| 10136 | 3260 | < 0.1% |
| 10140 | 39909 | |
| 10141 | 1491 | < 0.1% |
| 10146 | 1776 | < 0.1% |
| 10154 | 1813 | < 0.1% |
| 10155 | 2733 | < 0.1% |
| 10157 | 3042 | < 0.1% |
| 10158 | 5975 | < 0.1% |
| 10165 | 197 | < 0.1% |
| Value | Count | Frequency (%) |
| 16869 | 1200 | < 0.1% |
| 16218 | 3391 | < 0.1% |
| 16133 | 2 | < 0.1% |
| 16101 | 262 | < 0.1% |
| 15991 | 1481 | < 0.1% |
| 15919 | 18630 | |
| 15897 | 536 | < 0.1% |
| 15841 | 1465 | < 0.1% |
| 15624 | 14367 | |
| 15607 | 1847 | < 0.1% |
DEST_AIRPORT_SEQ_ID
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 758 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1266070.5 |
| Minimum | 1013503 |
|---|---|
| Maximum | 1686902 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 91.5 MiB |
Quantile statistics
| Minimum | 1013503 |
|---|---|
| 5-th percentile | 1039707 |
| Q1 | 1129202 |
| median | 1288903 |
| Q3 | 1402702 |
| 95-th percentile | 1489302 |
| Maximum | 1686902 |
| Range | 673399 |
| Interquartile range (IQR) | 273500 |
Descriptive statistics
| Standard deviation | 153037 |
|---|---|
| Coefficient of variation (CV) | 0.12087558 |
| Kurtosis | -1.3059048 |
| Mean | 1266070.5 |
| Median Absolute Deviation (MAD) | 156799 |
| Skewness | 0.083449257 |
| Sum | 1.5181904 × 1013 |
| Variance | 2.3420323 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1129202 | 473432 | 3.9% |
| 1039707 | 407930 | 3.4% |
| 1129806 | 341641 | 2.8% |
| 1105703 | 331170 | 2.8% |
| 1410702 | 314299 | 2.6% |
| 1474703 | 273070 | 2.3% |
| 1226603 | 268983 | 2.2% |
| 1039705 | 262879 | 2.2% |
| 1320402 | 257836 | 2.2% |
| 1143302 | 251283 | 2.1% |
| Other values (748) | 8808835 |
| Value | Count | Frequency (%) |
| 1013503 | 1546 | < 0.1% |
| 1013504 | 57 | < 0.1% |
| 1013505 | 1499 | < 0.1% |
| 1013506 | 3922 | < 0.1% |
| 1013603 | 3260 | < 0.1% |
| 1014003 | 14438 | |
| 1014004 | 318 | < 0.1% |
| 1014005 | 25153 | |
| 1014103 | 432 | < 0.1% |
| 1014104 | 113 | < 0.1% |
| Value | Count | Frequency (%) |
| 1686902 | 554 | < 0.1% |
| 1686901 | 646 | < 0.1% |
| 1621802 | 1809 | < 0.1% |
| 1621801 | 1582 | < 0.1% |
| 1613305 | 2 | < 0.1% |
| 1610102 | 262 | < 0.1% |
| 1599102 | 1481 | < 0.1% |
| 1591905 | 3504 | < 0.1% |
| 1591904 | 9715 | |
| 1591903 | 75 | < 0.1% |
DEST_CITY_MARKET_ID
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 361 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 31730.473 |
| Minimum | 30070 |
|---|---|
| Maximum | 36133 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 91.5 MiB |
Quantile statistics
| Minimum | 30070 |
|---|---|
| 5-th percentile | 30194 |
| Q1 | 30647 |
| median | 31453 |
| Q3 | 32467 |
| 95-th percentile | 34524 |
| Maximum | 36133 |
| Range | 6063 |
| Interquartile range (IQR) | 1820 |
Descriptive statistics
| Standard deviation | 1305.9278 |
|---|---|
| Coefficient of variation (CV) | 0.041156897 |
| Kurtosis | -0.24865616 |
| Mean | 31730.473 |
| Median Absolute Deviation (MAD) | 987 |
| Skewness | 0.82394087 |
| Sum | 3.8049147 × 1011 |
| Variance | 1705447.5 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 31703 | 686423 | 5.7% |
| 30977 | 680709 | 5.7% |
| 30397 | 677146 | 5.6% |
| 30194 | 630025 | 5.3% |
| 32575 | 570790 | 4.8% |
| 30325 | 473432 | 3.9% |
| 30852 | 469977 | 3.9% |
| 32457 | 459179 | 3.8% |
| 31453 | 374128 | 3.1% |
| 31057 | 331170 | 2.8% |
| Other values (351) | 6638379 |
| Value | Count | Frequency (%) |
| 30070 | 1309 | < 0.1% |
| 30073 | 1396 | < 0.1% |
| 30082 | 71 | < 0.1% |
| 30107 | 1251 | < 0.1% |
| 30113 | 1553 | < 0.1% |
| 30135 | 7024 | 0.1% |
| 30136 | 3260 | < 0.1% |
| 30140 | 39909 | |
| 30141 | 1491 | < 0.1% |
| 30146 | 1776 | < 0.1% |
| Value | Count | Frequency (%) |
| 36133 | 2 | < 0.1% |
| 36101 | 262 | < 0.1% |
| 35991 | 1481 | |
| 35897 | 536 | < 0.1% |
| 35841 | 1465 | |
| 35582 | 732 | < 0.1% |
| 35569 | 395 | < 0.1% |
| 35550 | 1847 | |
| 35497 | 108 | < 0.1% |
| 35454 | 261 | < 0.1% |
DEST
Text
| Distinct | 387 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 91.5 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 35974074 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PHX |
|---|---|
| 2nd row | SFO |
| 3rd row | LGA |
| 4th row | STL |
| 5th row | XNA |
| Value | Count | Frequency (%) |
| atl | 677146 | 5.6% |
| ord | 523625 | 4.4% |
| dfw | 501097 | 4.2% |
| den | 473432 | 3.9% |
| lax | 379305 | 3.2% |
| clt | 331170 | 2.8% |
| phx | 314299 | 2.6% |
| las | 299584 | 2.5% |
| sfo | 280522 | 2.3% |
| sea | 273070 | 2.3% |
| Other values (377) | 7938108 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 4064902 | 11.3% |
| L | 3394790 | 9.4% |
| S | 3023811 | 8.4% |
| D | 2836627 | 7.9% |
| T | 2018906 | 5.6% |
| O | 1911137 | 5.3% |
| C | 1741782 | 4.8% |
| M | 1606190 | 4.5% |
| F | 1510391 | 4.2% |
| W | 1447098 | 4.0% |
| Other values (16) | 12418440 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 35974074 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 4064902 | 11.3% |
| L | 3394790 | 9.4% |
| S | 3023811 | 8.4% |
| D | 2836627 | 7.9% |
| T | 2018906 | 5.6% |
| O | 1911137 | 5.3% |
| C | 1741782 | 4.8% |
| M | 1606190 | 4.5% |
| F | 1510391 | 4.2% |
| W | 1447098 | 4.0% |
| Other values (16) | 12418440 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 35974074 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 4064902 | 11.3% |
| L | 3394790 | 9.4% |
| S | 3023811 | 8.4% |
| D | 2836627 | 7.9% |
| T | 2018906 | 5.6% |
| O | 1911137 | 5.3% |
| C | 1741782 | 4.8% |
| M | 1606190 | 4.5% |
| F | 1510391 | 4.2% |
| W | 1447098 | 4.0% |
| Other values (16) | 12418440 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 35974074 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 4064902 | 11.3% |
| L | 3394790 | 9.4% |
| S | 3023811 | 8.4% |
| D | 2836627 | 7.9% |
| T | 2018906 | 5.6% |
| O | 1911137 | 5.3% |
| C | 1741782 | 4.8% |
| M | 1606190 | 4.5% |
| F | 1510391 | 4.2% |
| W | 1447098 | 4.0% |
| Other values (16) | 12418440 |
DEST_CITY_NAME
Text
| Distinct | 379 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 91.5 MiB |
Length
| Max length | 34 |
|---|---|
| Median length | 29 |
| Mean length | 13.080979 |
| Min length | 8 |
Characters and Unicode
| Total characters | 156858707 |
|---|---|
| Distinct characters | 58 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Phoenix, AZ |
|---|---|
| 2nd row | San Francisco, CA |
| 3rd row | New York, NY |
| 4th row | St. Louis, MO |
| 5th row | Fayetteville, AR |
| Value | Count | Frequency (%) |
| ca | 1357460 | 4.9% |
| tx | 1303577 | 4.7% |
| fl | 1002829 | 3.6% |
| ga | 716987 | 2.6% |
| il | 708552 | 2.5% |
| chicago | 680709 | 2.4% |
| atlanta | 677146 | 2.4% |
| san | 653417 | 2.3% |
| ny | 565989 | 2.0% |
| co | 528129 | 1.9% |
| Other values (456) | 19769892 |
Most occurring characters
| Value | Count | Frequency (%) |
| 15973329 | 10.2% | |
| a | 12084815 | 7.7% |
| , | 11991358 | 7.6% |
| o | 8780644 | 5.6% |
| e | 8182712 | 5.2% |
| n | 7723542 | 4.9% |
| t | 7690934 | 4.9% |
| l | 6939993 | 4.4% |
| i | 5986361 | 3.8% |
| r | 5577240 | 3.6% |
| Other values (48) | 65927779 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 156858707 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 15973329 | 10.2% | |
| a | 12084815 | 7.7% |
| , | 11991358 | 7.6% |
| o | 8780644 | 5.6% |
| e | 8182712 | 5.2% |
| n | 7723542 | 4.9% |
| t | 7690934 | 4.9% |
| l | 6939993 | 4.4% |
| i | 5986361 | 3.8% |
| r | 5577240 | 3.6% |
| Other values (48) | 65927779 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 156858707 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 15973329 | 10.2% | |
| a | 12084815 | 7.7% |
| , | 11991358 | 7.6% |
| o | 8780644 | 5.6% |
| e | 8182712 | 5.2% |
| n | 7723542 | 4.9% |
| t | 7690934 | 4.9% |
| l | 6939993 | 4.4% |
| i | 5986361 | 3.8% |
| r | 5577240 | 3.6% |
| Other values (48) | 65927779 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 156858707 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 15973329 | 10.2% | |
| a | 12084815 | 7.7% |
| , | 11991358 | 7.6% |
| o | 8780644 | 5.6% |
| e | 8182712 | 5.2% |
| n | 7723542 | 4.9% |
| t | 7690934 | 4.9% |
| l | 6939993 | 4.4% |
| i | 5986361 | 3.8% |
| r | 5577240 | 3.6% |
| Other values (48) | 65927779 |
DEP_TIME
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 1440 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 234621 |
| Missing (%) | 2.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1330.4342 |
| Minimum | 1 |
|---|---|
| Maximum | 2400 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 91.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 604 |
| Q1 | 917 |
| median | 1324 |
| Q3 | 1738 |
| 95-th percentile | 2131 |
| Maximum | 2400 |
| Range | 2399 |
| Interquartile range (IQR) | 821 |
Descriptive statistics
| Standard deviation | 497.87618 |
|---|---|
| Coefficient of variation (CV) | 0.37422082 |
| Kurtosis | -0.96831036 |
| Mean | 1330.4342 |
| Median Absolute Deviation (MAD) | 411 |
| Skewness | 0.037810536 |
| Sum | 1.5641565 × 1010 |
| Variance | 247880.69 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 555 | 30575 | 0.3% |
| 556 | 28303 | 0.2% |
| 557 | 27930 | 0.2% |
| 655 | 26353 | 0.2% |
| 558 | 26125 | 0.2% |
| 554 | 24417 | 0.2% |
| 656 | 24408 | 0.2% |
| 559 | 24169 | 0.2% |
| 657 | 23581 | 0.2% |
| 600 | 22390 | 0.2% |
| Other values (1430) | 11498486 | |
| (Missing) | 234621 | 2.0% |
| Value | Count | Frequency (%) |
| 1 | 1432 | |
| 2 | 1139 | |
| 3 | 1092 | |
| 4 | 1026 | |
| 5 | 981 | |
| 6 | 959 | |
| 7 | 990 | |
| 8 | 963 | |
| 9 | 944 | |
| 10 | 884 |
| Value | Count | Frequency (%) |
| 2400 | 971 | |
| 2359 | 1617 | |
| 2358 | 1593 | |
| 2357 | 1655 | |
| 2356 | 1839 | |
| 2355 | 2004 | |
| 2354 | 2043 | |
| 2353 | 2131 | |
| 2352 | 2092 | |
| 2351 | 2010 |
DEP_DELAY
Real number (ℝ)
HIGH CORRELATION  MISSING  ZEROS 
| Distinct | 1782 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 234846 |
| Missing (%) | 2.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.5235726 |
| Minimum | -234 |
|---|---|
| Maximum | 3360 |
| Zeros | 592302 |
| Zeros (%) | 4.9% |
| Negative | 7148564 |
| Negative (%) | 59.6% |
| Memory size | 91.5 MiB |
Quantile statistics
| Minimum | -234 |
|---|---|
| 5-th percentile | -10 |
| Q1 | -5 |
| median | -2 |
| Q3 | 6 |
| 95-th percentile | 69 |
| Maximum | 3360 |
| Range | 3594 |
| Interquartile range (IQR) | 11 |
Descriptive statistics
| Standard deviation | 45.714361 |
|---|---|
| Coefficient of variation (CV) | 4.8001273 |
| Kurtosis | 242.38101 |
| Mean | 9.5235726 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 11.203295 |
| Sum | 1.11964 × 108 |
| Variance | 2089.8028 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -5 | 953455 | 8.0% |
| -4 | 908498 | 7.6% |
| -3 | 883157 | 7.4% |
| -2 | 802595 | 6.7% |
| -6 | 742406 | 6.2% |
| -1 | 703421 | 5.9% |
| -7 | 597566 | 5.0% |
| 0 | 592302 | 4.9% |
| -8 | 457155 | 3.8% |
| -9 | 336325 | 2.8% |
| Other values (1772) | 4779632 |
| Value | Count | Frequency (%) |
| -234 | 1 | |
| -204 | 1 | |
| -201 | 1 | |
| -151 | 1 | |
| -130 | 1 | |
| -112 | 1 | |
| -102 | 1 | |
| -96 | 1 | |
| -92 | 1 | |
| -91 | 1 |
| Value | Count | Frequency (%) |
| 3360 | 1 | |
| 3343 | 1 | |
| 3095 | 1 | |
| 3051 | 1 | |
| 3011 | 1 | |
| 2994 | 1 | |
| 2895 | 1 | |
| 2871 | 1 | |
| 2816 | 1 | |
| 2765 | 1 |
TAXI_OUT
Real number (ℝ)
MISSING 
| Distinct | 189 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 239344 |
| Missing (%) | 2.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16.570201 |
| Minimum | 1 |
|---|---|
| Maximum | 256 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 91.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 8 |
| Q1 | 11 |
| median | 14 |
| Q3 | 19 |
| 95-th percentile | 33 |
| Maximum | 256 |
| Range | 255 |
| Interquartile range (IQR) | 8 |
Descriptive statistics
| Standard deviation | 9.1586671 |
|---|---|
| Coefficient of variation (CV) | 0.55271913 |
| Kurtosis | 22.843441 |
| Mean | 16.570201 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 3.3948698 |
| Sum | 1.9473324 × 108 |
| Variance | 83.881183 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12 | 962702 | 8.0% |
| 11 | 942237 | 7.9% |
| 13 | 923454 | 7.7% |
| 10 | 847067 | 7.1% |
| 14 | 846288 | 7.1% |
| 15 | 753946 | 6.3% |
| 9 | 671912 | 5.6% |
| 16 | 656378 | 5.5% |
| 17 | 565482 | 4.7% |
| 18 | 485361 | 4.0% |
| Other values (179) | 4097187 |
| Value | Count | Frequency (%) |
| 1 | 293 | < 0.1% |
| 2 | 462 | < 0.1% |
| 3 | 2667 | < 0.1% |
| 4 | 9224 | 0.1% |
| 5 | 32899 | 0.3% |
| 6 | 112978 | 0.9% |
| 7 | 258397 | 2.2% |
| 8 | 455479 | |
| 9 | 671912 | |
| 10 | 847067 |
| Value | Count | Frequency (%) |
| 256 | 1 | < 0.1% |
| 213 | 1 | < 0.1% |
| 201 | 1 | < 0.1% |
| 198 | 1 | < 0.1% |
| 188 | 1 | < 0.1% |
| 186 | 1 | < 0.1% |
| 183 | 4 | |
| 182 | 3 | |
| 181 | 1 | < 0.1% |
| 180 | 1 | < 0.1% |
TAXI_IN
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 234 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 244367 |
| Missing (%) | 2.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.5612734 |
| Minimum | 1 |
|---|---|
| Maximum | 1426 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 91.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 4 |
| median | 6 |
| Q3 | 9 |
| 95-th percentile | 17 |
| Maximum | 1426 |
| Range | 1425 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 6.0072554 |
|---|---|
| Coefficient of variation (CV) | 0.79447669 |
| Kurtosis | 335.58966 |
| Mean | 7.5612734 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 6.2853995 |
| Sum | 88822210 |
| Variance | 36.087118 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 1842450 | |
| 5 | 1774571 | |
| 6 | 1447096 | |
| 3 | 1205823 | |
| 7 | 1130520 | |
| 8 | 839388 | |
| 9 | 641651 | 5.4% |
| 10 | 495265 | 4.1% |
| 11 | 380283 | 3.2% |
| 2 | 331134 | 2.8% |
| Other values (224) | 1658810 |
| Value | Count | Frequency (%) |
| 1 | 19561 | 0.2% |
| 2 | 331134 | 2.8% |
| 3 | 1205823 | |
| 4 | 1842450 | |
| 5 | 1774571 | |
| 6 | 1447096 | |
| 7 | 1130520 | |
| 8 | 839388 | |
| 9 | 641651 | 5.4% |
| 10 | 495265 | 4.1% |
| Value | Count | Frequency (%) |
| 1426 | 1 | |
| 414 | 1 | |
| 400 | 1 | |
| 399 | 2 | |
| 397 | 1 | |
| 365 | 1 | |
| 349 | 1 | |
| 344 | 1 | |
| 341 | 1 | |
| 315 | 1 |
ARR_TIME
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 1440 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 244362 |
| Missing (%) | 2.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1470.5384 |
| Minimum | 1 |
|---|---|
| Maximum | 2400 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 91.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 656 |
| Q1 | 1055 |
| median | 1508 |
| Q3 | 1914 |
| 95-th percentile | 2248 |
| Maximum | 2400 |
| Range | 2399 |
| Interquartile range (IQR) | 859 |
Descriptive statistics
| Standard deviation | 529.03114 |
|---|---|
| Coefficient of variation (CV) | 0.35975336 |
| Kurtosis | -0.36810708 |
| Mean | 1470.5384 |
| Median Absolute Deviation (MAD) | 410 |
| Skewness | -0.35553999 |
| Sum | 1.7274409 × 1010 |
| Variance | 279873.94 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1643 | 13038 | 0.1% |
| 1641 | 13008 | 0.1% |
| 1646 | 12926 | 0.1% |
| 1634 | 12921 | 0.1% |
| 1638 | 12910 | 0.1% |
| 1628 | 12902 | 0.1% |
| 1630 | 12898 | 0.1% |
| 1645 | 12871 | 0.1% |
| 1635 | 12867 | 0.1% |
| 1637 | 12827 | 0.1% |
| Other values (1430) | 11617828 | |
| (Missing) | 244362 | 2.0% |
| Value | Count | Frequency (%) |
| 1 | 6484 | |
| 2 | 5641 | |
| 3 | 5638 | |
| 4 | 5410 | |
| 5 | 5401 | |
| 6 | 5196 | |
| 7 | 5250 | |
| 8 | 4977 | |
| 9 | 4897 | |
| 10 | 4881 |
| Value | Count | Frequency (%) |
| 2400 | 5456 | |
| 2359 | 6030 | |
| 2358 | 6237 | |
| 2357 | 6524 | |
| 2356 | 6561 | |
| 2355 | 6739 | |
| 2354 | 6835 | |
| 2353 | 6953 | |
| 2352 | 7128 | |
| 2351 | 7272 |
ARR_DELAY
Real number (ℝ)
HIGH CORRELATION  MISSING  ZEROS 
| Distinct | 1794 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 268767 |
| Missing (%) | 2.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.9905049 |
| Minimum | -235 |
|---|---|
| Maximum | 3359 |
| Zeros | 229257 |
| Zeros (%) | 1.9% |
| Negative | 7497114 |
| Negative (%) | 62.5% |
| Memory size | 91.5 MiB |
Quantile statistics
| Minimum | -235 |
|---|---|
| 5-th percentile | -27 |
| Q1 | -15 |
| median | -6 |
| Q3 | 7 |
| 95-th percentile | 68 |
| Maximum | 3359 |
| Range | 3594 |
| Interquartile range (IQR) | 22 |
Descriptive statistics
| Standard deviation | 47.707003 |
|---|---|
| Coefficient of variation (CV) | 11.955129 |
| Kurtosis | 204.77502 |
| Mean | 3.9905049 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 9.928557 |
| Sum | 46779057 |
| Variance | 2275.9581 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -10 | 351374 | 2.9% |
| -11 | 349606 | 2.9% |
| -9 | 348180 | 2.9% |
| -12 | 346214 | 2.9% |
| -8 | 341664 | 2.8% |
| -13 | 337155 | 2.8% |
| -7 | 332418 | 2.8% |
| -14 | 325674 | 2.7% |
| -6 | 319403 | 2.7% |
| -15 | 313329 | 2.6% |
| Other values (1784) | 8357574 |
| Value | Count | Frequency (%) |
| -235 | 1 | |
| -194 | 1 | |
| -151 | 1 | |
| -148 | 1 | |
| -121 | 1 | |
| -119 | 1 | |
| -117 | 1 | |
| -108 | 1 | |
| -107 | 1 | |
| -106 | 1 |
| Value | Count | Frequency (%) |
| 3359 | 1 | |
| 3337 | 1 | |
| 3089 | 1 | |
| 3045 | 1 | |
| 3027 | 1 | |
| 2977 | 1 | |
| 2900 | 1 | |
| 2854 | 1 | |
| 2795 | 1 | |
| 2748 | 1 |
CANCELLED
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 91.5 MiB |
| 0.0 | |
|---|---|
| 1.0 | 240660 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 35974074 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 11750698 | |
| 1.0 | 240660 | 2.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 11750698 | |
| 1.0 | 240660 | 2.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 23742056 | |
| . | 11991358 | |
| 1 | 240660 | 0.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 35974074 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 23742056 | |
| . | 11991358 | |
| 1 | 240660 | 0.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 35974074 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 23742056 | |
| . | 11991358 | |
| 1 | 240660 | 0.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 35974074 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 23742056 | |
| . | 11991358 | |
| 1 | 240660 | 0.7% |
CANCELLATION_CODE
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 11750698 |
| Missing (%) | 98.0% |
| Memory size | 91.5 MiB |
| B | |
|---|---|
| A | |
| D | |
| C |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 240660 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | B |
|---|---|
| 2nd row | A |
| 3rd row | B |
| 4th row | A |
| 5th row | C |
Common Values
| Value | Count | Frequency (%) |
| B | 98334 | 0.8% |
| A | 58590 | 0.5% |
| D | 56053 | 0.5% |
| C | 27683 | 0.2% |
| (Missing) | 11750698 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| b | 98334 | |
| a | 58590 | |
| d | 56053 | |
| c | 27683 | 11.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| B | 98334 | |
| A | 58590 | |
| D | 56053 | |
| C | 27683 | 11.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 240660 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| B | 98334 | |
| A | 58590 | |
| D | 56053 | |
| C | 27683 | 11.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 240660 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| B | 98334 | |
| A | 58590 | |
| D | 56053 | |
| C | 27683 | 11.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 240660 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| B | 98334 | |
| A | 58590 | |
| D | 56053 | |
| C | 27683 | 11.5% |
CARRIER_DELAY
Real number (ℝ)
HIGH CORRELATION  MISSING  ZEROS 
| Distinct | 1581 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 9870493 |
| Missing (%) | 82.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 22.324728 |
| Minimum | 0 |
|---|---|
| Maximum | 3359 |
| Zeros | 986724 |
| Zeros (%) | 8.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 91.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 2 |
| Q3 | 20 |
| 95-th percentile | 97 |
| Maximum | 3359 |
| Range | 3359 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 66.034238 |
|---|---|
| Coefficient of variation (CV) | 2.9578966 |
| Kurtosis | 170.95033 |
| Mean | 22.324728 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 10.234107 |
| Sum | 47347735 |
| Variance | 4360.5206 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 986724 | 8.2% |
| 1 | 38989 | 0.3% |
| 2 | 38553 | 0.3% |
| 3 | 37386 | 0.3% |
| 6 | 36744 | 0.3% |
| 4 | 36368 | 0.3% |
| 5 | 34735 | 0.3% |
| 7 | 34064 | 0.3% |
| 15 | 33774 | 0.3% |
| 8 | 32314 | 0.3% |
| Other values (1571) | 811214 | 6.8% |
| (Missing) | 9870493 |
| Value | Count | Frequency (%) |
| 0 | 986724 | |
| 1 | 38989 | 0.3% |
| 2 | 38553 | 0.3% |
| 3 | 37386 | 0.3% |
| 4 | 36368 | 0.3% |
| 5 | 34735 | 0.3% |
| 6 | 36744 | 0.3% |
| 7 | 34064 | 0.3% |
| 8 | 32314 | 0.3% |
| 9 | 29920 | 0.2% |
| Value | Count | Frequency (%) |
| 3359 | 1 | |
| 3337 | 1 | |
| 3089 | 1 | |
| 3045 | 1 | |
| 3027 | 1 | |
| 2977 | 1 | |
| 2742 | 1 | |
| 2675 | 1 | |
| 2660 | 1 | |
| 2653 | 1 |
WEATHER_DELAY
Real number (ℝ)
HIGH CORRELATION  MISSING  SKEWED  ZEROS 
| Distinct | 1107 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 9870493 |
| Missing (%) | 82.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.5106841 |
| Minimum | 0 |
|---|---|
| Maximum | 2475 |
| Zeros | 1999251 |
| Zeros (%) | 16.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 91.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 7 |
| Maximum | 2475 |
| Range | 2475 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 29.157121 |
|---|---|
| Coefficient of variation (CV) | 8.3052533 |
| Kurtosis | 646.26758 |
| Mean | 3.5106841 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 20.827678 |
| Sum | 7445687 |
| Variance | 850.13768 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1999251 | 16.7% |
| 7 | 2608 | < 0.1% |
| 15 | 2600 | < 0.1% |
| 6 | 2600 | < 0.1% |
| 2 | 2497 | < 0.1% |
| 16 | 2485 | < 0.1% |
| 5 | 2436 | < 0.1% |
| 8 | 2435 | < 0.1% |
| 3 | 2435 | < 0.1% |
| 9 | 2409 | < 0.1% |
| Other values (1097) | 99109 | 0.8% |
| (Missing) | 9870493 |
| Value | Count | Frequency (%) |
| 0 | 1999251 | |
| 1 | 2324 | < 0.1% |
| 2 | 2497 | < 0.1% |
| 3 | 2435 | < 0.1% |
| 4 | 2380 | < 0.1% |
| 5 | 2436 | < 0.1% |
| 6 | 2600 | < 0.1% |
| 7 | 2608 | < 0.1% |
| 8 | 2435 | < 0.1% |
| 9 | 2409 | < 0.1% |
| Value | Count | Frequency (%) |
| 2475 | 1 | |
| 2363 | 1 | |
| 2098 | 1 | |
| 1747 | 1 | |
| 1728 | 1 | |
| 1581 | 1 | |
| 1561 | 1 | |
| 1552 | 1 | |
| 1529 | 1 | |
| 1525 | 1 |
NAS_DELAY
Real number (ℝ)
HIGH CORRELATION  MISSING  ZEROS 
| Distinct | 930 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 9870493 |
| Missing (%) | 82.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.868871 |
| Minimum | 0 |
|---|---|
| Maximum | 1660 |
| Zeros | 1037152 |
| Zeros (%) | 8.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 91.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 18 |
| 95-th percentile | 58 |
| Maximum | 1660 |
| Range | 1660 |
| Interquartile range (IQR) | 18 |
Descriptive statistics
| Standard deviation | 32.058563 |
|---|---|
| Coefficient of variation (CV) | 2.311548 |
| Kurtosis | 211.02129 |
| Mean | 13.868871 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 9.707252 |
| Sum | 29414004 |
| Variance | 1027.7514 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1037152 | 8.6% |
| 1 | 50704 | 0.4% |
| 15 | 44094 | 0.4% |
| 2 | 41394 | 0.3% |
| 16 | 39887 | 0.3% |
| 3 | 38763 | 0.3% |
| 4 | 36557 | 0.3% |
| 17 | 36197 | 0.3% |
| 5 | 34474 | 0.3% |
| 18 | 32663 | 0.3% |
| Other values (920) | 728980 | 6.1% |
| (Missing) | 9870493 |
| Value | Count | Frequency (%) |
| 0 | 1037152 | |
| 1 | 50704 | 0.4% |
| 2 | 41394 | 0.3% |
| 3 | 38763 | 0.3% |
| 4 | 36557 | 0.3% |
| 5 | 34474 | 0.3% |
| 6 | 32157 | 0.3% |
| 7 | 30263 | 0.3% |
| 8 | 28658 | 0.2% |
| 9 | 27082 | 0.2% |
| Value | Count | Frequency (%) |
| 1660 | 1 | |
| 1642 | 1 | |
| 1516 | 1 | |
| 1508 | 1 | |
| 1436 | 1 | |
| 1427 | 1 | |
| 1425 | 1 | |
| 1419 | 1 | |
| 1407 | 1 | |
| 1404 | 1 |
SECURITY_DELAY
Real number (ℝ)
HIGH CORRELATION  MISSING  SKEWED  ZEROS 
| Distinct | 259 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 9870493 |
| Missing (%) | 82.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.12518289 |
| Minimum | 0 |
|---|---|
| Maximum | 1183 |
| Zeros | 2111466 |
| Zeros (%) | 17.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 91.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 1183 |
| Range | 1183 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 3.6422224 |
|---|---|
| Coefficient of variation (CV) | 29.095211 |
| Kurtosis | 22780.459 |
| Mean | 0.12518289 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 112.03958 |
| Sum | 265496 |
| Variance | 13.265784 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2111466 | 17.6% |
| 15 | 420 | < 0.1% |
| 16 | 335 | < 0.1% |
| 18 | 321 | < 0.1% |
| 8 | 316 | < 0.1% |
| 17 | 313 | < 0.1% |
| 10 | 304 | < 0.1% |
| 12 | 274 | < 0.1% |
| 7 | 268 | < 0.1% |
| 11 | 267 | < 0.1% |
| Other values (249) | 6581 | 0.1% |
| (Missing) | 9870493 |
| Value | Count | Frequency (%) |
| 0 | 2111466 | |
| 1 | 200 | < 0.1% |
| 2 | 233 | < 0.1% |
| 3 | 246 | < 0.1% |
| 4 | 253 | < 0.1% |
| 5 | 239 | < 0.1% |
| 6 | 267 | < 0.1% |
| 7 | 268 | < 0.1% |
| 8 | 316 | < 0.1% |
| 9 | 265 | < 0.1% |
| Value | Count | Frequency (%) |
| 1183 | 1 | |
| 1091 | 1 | |
| 987 | 1 | |
| 983 | 1 | |
| 816 | 1 | |
| 789 | 1 | |
| 738 | 1 | |
| 691 | 1 | |
| 656 | 1 | |
| 653 | 1 |
LATE_AIRCRAFT_DELAY
Real number (ℝ)
HIGH CORRELATION  MISSING  ZEROS 
| Distinct | 1256 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 9870493 |
| Missing (%) | 82.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 24.981462 |
| Minimum | 0 |
|---|---|
| Maximum | 2690 |
| Zeros | 1058694 |
| Zeros (%) | 8.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 91.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 30 |
| 95-th percentile | 114 |
| Maximum | 2690 |
| Range | 2690 |
| Interquartile range (IQR) | 30 |
Descriptive statistics
| Standard deviation | 52.300303 |
|---|---|
| Coefficient of variation (CV) | 2.0935646 |
| Kurtosis | 106.21905 |
| Mean | 24.981462 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 6.8471653 |
| Sum | 52982308 |
| Variance | 2735.3217 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1058694 | 8.8% |
| 15 | 26673 | 0.2% |
| 16 | 25093 | 0.2% |
| 17 | 23807 | 0.2% |
| 18 | 22821 | 0.2% |
| 19 | 21530 | 0.2% |
| 20 | 20662 | 0.2% |
| 21 | 19647 | 0.2% |
| 14 | 19419 | 0.2% |
| 13 | 19055 | 0.2% |
| Other values (1246) | 863464 | 7.2% |
| (Missing) | 9870493 |
| Value | Count | Frequency (%) |
| 0 | 1058694 | |
| 1 | 16375 | 0.1% |
| 2 | 16155 | 0.1% |
| 3 | 15670 | 0.1% |
| 4 | 15372 | 0.1% |
| 5 | 15471 | 0.1% |
| 6 | 16596 | 0.1% |
| 7 | 16394 | 0.1% |
| 8 | 17085 | 0.1% |
| 9 | 17414 | 0.1% |
| Value | Count | Frequency (%) |
| 2690 | 1 | |
| 2258 | 1 | |
| 2228 | 1 | |
| 2098 | 1 | |
| 2096 | 1 | |
| 2093 | 1 | |
| 2088 | 1 | |
| 2010 | 1 | |
| 2006 | 1 | |
| 1926 | 1 |
MMYYYY
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 120 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 201906.46 |
| Minimum | 201407 |
|---|---|
| Maximum | 202406 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 91.5 MiB |
Quantile statistics
| Minimum | 201407 |
|---|---|
| 5-th percentile | 201412 |
| Q1 | 201612 |
| median | 201906 |
| Q3 | 202201 |
| 95-th percentile | 202401 |
| Maximum | 202406 |
| Range | 999 |
| Interquartile range (IQR) | 589 |
Descriptive statistics
| Standard deviation | 291.12048 |
|---|---|
| Coefficient of variation (CV) | 0.0014418582 |
| Kurtosis | -1.1631919 |
| Mean | 201906.46 |
| Median Absolute Deviation (MAD) | 294 |
| Skewness | 0.00042383716 |
| Sum | 2.4211327 × 1012 |
| Variance | 84751.136 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 201410 | 100000 | 0.8% |
| 201609 | 100000 | 0.8% |
| 202105 | 100000 | 0.8% |
| 202111 | 100000 | 0.8% |
| 202311 | 100000 | 0.8% |
| 201711 | 99999 | 0.8% |
| 201611 | 99999 | 0.8% |
| 201705 | 99999 | 0.8% |
| 202002 | 99999 | 0.8% |
| 202303 | 99999 | 0.8% |
| Other values (110) | 10991363 |
| Value | Count | Frequency (%) |
| 201407 | 99989 | |
| 201408 | 99996 | |
| 201409 | 99990 | |
| 201410 | 100000 | |
| 201411 | 99996 | |
| 201412 | 99995 | |
| 201501 | 99861 | |
| 201502 | 99844 | |
| 201503 | 99953 | |
| 201504 | 99996 |
| Value | Count | Frequency (%) |
| 202406 | 99996 | |
| 202405 | 99998 | |
| 202404 | 99999 | |
| 202403 | 99996 | |
| 202402 | 99996 | |
| 202401 | 99947 | |
| 202312 | 99997 | |
| 202311 | 100000 | |
| 202310 | 99999 | |
| 202309 | 99997 |
| ARR_DELAY | ARR_TIME | CANCELLATION_CODE | CANCELLED | CARRIER_DELAY | DEP_DELAY | DEP_TIME | DEST_AIRPORT_ID | DEST_AIRPORT_SEQ_ID | DEST_CITY_MARKET_ID | LATE_AIRCRAFT_DELAY | MMYYYY | NAS_DELAY | OP_CARRIER | OP_UNIQUE_CARRIER | ORIGIN_AIRPORT_ID | ORIGIN_AIRPORT_SEQ_ID | ORIGIN_CITY_MARKET_ID | SECURITY_DELAY | TAXI_IN | TAXI_OUT | WEATHER_DELAY | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ARR_DELAY | 1.000 | 0.114 | 0.000 | 1.000 | 0.191 | 0.650 | 0.162 | 0.017 | 0.017 | 0.022 | 0.356 | -0.034 | 0.010 | 0.018 | 0.018 | -0.007 | -0.008 | -0.029 | -0.008 | 0.110 | 0.264 | 0.128 |
| ARR_TIME | 0.114 | 1.000 | 0.000 | 1.000 | -0.065 | 0.144 | 0.758 | 0.020 | 0.020 | 0.047 | 0.140 | -0.007 | -0.015 | 0.046 | 0.046 | -0.004 | -0.004 | -0.044 | -0.002 | -0.026 | 0.025 | 0.002 |
| CANCELLATION_CODE | 0.000 | 0.000 | 1.000 | 1.000 | 0.000 | 0.048 | 0.165 | 0.065 | 0.065 | 0.092 | 0.000 | 0.600 | 0.000 | 0.287 | 0.287 | 0.071 | 0.071 | 0.095 | 0.000 | 0.000 | 0.066 | 0.000 |
| CANCELLED | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.024 | 0.010 | 0.014 | 0.014 | 0.018 | 1.000 | 0.264 | 1.000 | 0.053 | 0.053 | 0.012 | 0.012 | 0.018 | 1.000 | 1.000 | 0.004 | 1.000 |
| CARRIER_DELAY | 0.191 | -0.065 | 0.000 | 1.000 | 1.000 | 0.299 | -0.015 | 0.004 | 0.005 | 0.032 | -0.224 | 0.057 | -0.374 | 0.020 | 0.020 | -0.046 | -0.046 | -0.056 | -0.056 | -0.118 | -0.141 | -0.220 |
| DEP_DELAY | 0.650 | 0.144 | 0.048 | 0.024 | 0.299 | 1.000 | 0.205 | 0.011 | 0.010 | 0.024 | 0.468 | -0.025 | -0.380 | 0.017 | 0.017 | -0.032 | -0.033 | -0.066 | 0.001 | -0.048 | 0.023 | 0.108 |
| DEP_TIME | 0.162 | 0.758 | 0.165 | 0.010 | -0.015 | 0.205 | 1.000 | 0.030 | 0.030 | 0.064 | 0.290 | -0.003 | -0.133 | 0.045 | 0.045 | -0.035 | -0.035 | -0.054 | -0.004 | -0.067 | 0.003 | 0.013 |
| DEST_AIRPORT_ID | 0.017 | 0.020 | 0.065 | 0.014 | 0.004 | 0.011 | 0.030 | 1.000 | 1.000 | 0.619 | -0.011 | -0.007 | -0.003 | 0.178 | 0.178 | 0.013 | 0.013 | -0.011 | -0.003 | -0.127 | 0.023 | -0.012 |
| DEST_AIRPORT_SEQ_ID | 0.017 | 0.020 | 0.065 | 0.014 | 0.005 | 0.010 | 0.030 | 1.000 | 1.000 | 0.619 | -0.011 | 0.001 | -0.003 | 0.178 | 0.178 | 0.013 | 0.013 | -0.011 | -0.003 | -0.127 | 0.023 | -0.012 |
| DEST_CITY_MARKET_ID | 0.022 | 0.047 | 0.092 | 0.018 | 0.032 | 0.024 | 0.064 | 0.619 | 0.619 | 1.000 | -0.013 | 0.003 | -0.015 | 0.158 | 0.158 | -0.011 | -0.011 | -0.059 | -0.001 | -0.235 | 0.055 | -0.007 |
| LATE_AIRCRAFT_DELAY | 0.356 | 0.140 | 0.000 | 1.000 | -0.224 | 0.468 | 0.290 | -0.011 | -0.011 | -0.013 | 1.000 | -0.015 | -0.329 | 0.016 | 0.016 | 0.027 | 0.027 | 0.032 | -0.016 | -0.094 | -0.217 | -0.036 |
| MMYYYY | -0.034 | -0.007 | 0.600 | 0.264 | 0.057 | -0.025 | -0.003 | -0.007 | 0.001 | 0.003 | -0.015 | 1.000 | -0.057 | 0.108 | 0.108 | -0.007 | 0.001 | 0.003 | 0.019 | 0.023 | 0.046 | 0.007 |
| NAS_DELAY | 0.010 | -0.015 | 0.000 | 1.000 | -0.374 | -0.380 | -0.133 | -0.003 | -0.003 | -0.015 | -0.329 | -0.057 | 1.000 | 0.020 | 0.020 | 0.008 | 0.007 | 0.024 | -0.015 | 0.285 | 0.468 | -0.010 |
| OP_CARRIER | 0.018 | 0.046 | 0.287 | 0.053 | 0.020 | 0.017 | 0.045 | 0.178 | 0.178 | 0.158 | 0.016 | 0.108 | 0.020 | 1.000 | 1.000 | 0.178 | 0.178 | 0.158 | 0.003 | 0.002 | 0.050 | 0.013 |
| OP_UNIQUE_CARRIER | 0.018 | 0.046 | 0.287 | 0.053 | 0.020 | 0.017 | 0.045 | 0.178 | 0.178 | 0.158 | 0.016 | 0.108 | 0.020 | 1.000 | 1.000 | 0.178 | 0.178 | 0.158 | 0.003 | 0.002 | 0.050 | 0.013 |
| ORIGIN_AIRPORT_ID | -0.007 | -0.004 | 0.071 | 0.012 | -0.046 | -0.032 | -0.035 | 0.013 | 0.013 | -0.011 | 0.027 | -0.007 | 0.008 | 0.178 | 0.178 | 1.000 | 1.000 | 0.619 | -0.007 | 0.044 | -0.034 | -0.026 |
| ORIGIN_AIRPORT_SEQ_ID | -0.008 | -0.004 | 0.071 | 0.012 | -0.046 | -0.033 | -0.035 | 0.013 | 0.013 | -0.011 | 0.027 | 0.001 | 0.007 | 0.178 | 0.178 | 1.000 | 1.000 | 0.619 | -0.007 | 0.045 | -0.034 | -0.026 |
| ORIGIN_CITY_MARKET_ID | -0.029 | -0.044 | 0.095 | 0.018 | -0.056 | -0.066 | -0.054 | -0.011 | -0.011 | -0.059 | 0.032 | 0.003 | 0.024 | 0.158 | 0.158 | 0.619 | 0.619 | 1.000 | 0.002 | 0.112 | -0.061 | -0.032 |
| SECURITY_DELAY | -0.008 | -0.002 | 0.000 | 1.000 | -0.056 | 0.001 | -0.004 | -0.003 | -0.003 | -0.001 | -0.016 | 0.019 | -0.015 | 0.003 | 0.003 | -0.007 | -0.007 | 0.002 | 1.000 | -0.004 | -0.011 | -0.015 |
| TAXI_IN | 0.110 | -0.026 | 0.000 | 1.000 | -0.118 | -0.048 | -0.067 | -0.127 | -0.127 | -0.235 | -0.094 | 0.023 | 0.285 | 0.002 | 0.002 | 0.044 | 0.045 | 0.112 | -0.004 | 1.000 | 0.060 | -0.004 |
| TAXI_OUT | 0.264 | 0.025 | 0.066 | 0.004 | -0.141 | 0.023 | 0.003 | 0.023 | 0.023 | 0.055 | -0.217 | 0.046 | 0.468 | 0.050 | 0.050 | -0.034 | -0.034 | -0.061 | -0.011 | 0.060 | 1.000 | 0.071 |
| WEATHER_DELAY | 0.128 | 0.002 | 0.000 | 1.000 | -0.220 | 0.108 | 0.013 | -0.012 | -0.012 | -0.007 | -0.036 | 0.007 | -0.010 | 0.013 | 0.013 | -0.026 | -0.026 | -0.032 | -0.015 | -0.004 | 0.071 | 1.000 |